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Amendments to the Specification: 

Please replace the Specification with the following: 

METHOD AND SYSTEM FOR REDUCED LOCK CONTENTION 
IN SQL TRANSACTIONS 

CROSS-REFERENCE TO RELATED APPLICATIONS 

This application claims benefit under 35 USC 1 19 of European Application No. 
00124197.5 filed on November 8, 2000. 
BACKGROUND OF INVENTION 

The present invention relates to database management systems and more particularly to 
improving throughput of transactions by reducing lock contention of concurrent transactions on 
the rows of a table in a relational data base system. 

The rules that influence SQL result set creation semantics are defined in the SQL standard 
(ANSI Document X3. 135-1986, "Database Language SQL, American Standards Institute, New 
York, 1986; ISO Document ISO/TC 97/SC, 21/WG 3, N 1 17). These rules comprise locking 
isolation levels which determine when an application program can retrieve data during a 
transaction. In general, the rules specify whether the result set consists of the committed rows 
only or whether it can include the rows that were uncommitted at the time of result set creation. 
Typical examples of the former are Cursor Stability (CS) and Read Stability (RS), and for the 
later Uncommitted Read (UR). 

Many applications require committed read semantics which inevitably comes with the 
price of increased locking contention. In order to return a committed row the transaction might 
need to acquire a lock on it that is incompatible with locks requested by other concurrent 
transactions. With the known locking semantics the committed read transactions might requests 
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and held locks even on the rows that do not qualify for the result set by means of the statement 
predicates. Apart from the overhead of acquiring and releasing these seemingly unnecessary 
locks this behavior increases the number of lock request suspensions and in some cases results in 
deadlocks and abnormal transaction terminations. 

Accordingly, what is needed is a system and method for improving the throughput of 
transactions. In particular, what is needed is a system and method for reducing lock contentions 
in operations that require committed read semantics, and reducing deadlock situations in 
transactions. The present invention addresses such a need. 
BRIEF SUMMARY OF THE INVENTION 

The present invention is directed to a method, apparatus and computer readable medium 
for reducing lock contention of concurrent transactions. According to the invention, a scan is 
performed by accessing all rows of a table in a relational database, evaluating each row to 
determine whether the row satisfies the predicates of a database query, and returning the row if it 
satifies the predicates of the query. According to a preferred embodiment of the present 
invention, all rows are evaluated regardless of current locks. The scan is continued if the row 
does not satisfy the predicates of the query. A lock is requested if the row satisfies the predicates 
of the query, and the scan is suspended if the requested lock is refused. The request is repeated 
and the row is re-evaluated when the lock is permitted. If the row still satisfies the predicates of 
the query, it is returned . If the row no longer satifies the predicates of the query, the lock is 
released, the row is skipped and the scan is continued. 

The locking semantics according to the invention has the advantage that it completely 
removes the problem of lock contention on the rows that do not satisfy query predicates. The 
improved semantics of forming the result set is more strict than the Uncommitted Read (UR) 
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mode because it returns only committed rows, but is less strict than Cursor Stability (CS) and 
Read Stability (RS) because in some special cases the rows that would be returned by CS and RS 
as committed would not be part of the result set. 

BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWINGS. 

Figure 1 shows an example of the access to a table using the conventional read semantic 

rules. 

Figure 2 shows the example from Figure 1 by the use of read semantic rules according to 
the invention. 

Figure 3 is a schematic block diagram of a data base management system including 
apparatus using an implementation of the invention. 

Figure 4 is a flow diagram of an implementation of a method according to the invention. 
DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to improving the throughput of transactions. The following 
description is presented to enable one of ordinary skill in the art to make and use the invention and 
is provided in the context of a patent application and its requirements. Various modifications to the 
preferred embodiments and the generic principles and features described herein will be readily 
apparent to those skilled in the art. Thus, the present invention is not intended to be limited to the 
embodiments shown but is to be accorded the widest scope consistent with the principles and 
features described herein. 

The embodiments of the invention described herein relate to a relational database 
operated by a database management system (DBMS) by using the well known Structured Query 
Language called SQL. The DBMS supports a plurality of application programs which may 
simultaneously request access to the database. It is one of the functions of the DBMS to 
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coordinate transactions initiated by the application programs, and to avoid interferences between 
them. The access to the data base is performed by using the SQL Standards which provide a 
variety of statements including search conditions or predicates which allow a user to define 
certain data items in the database. One of these statements is the Select Statement which 
produces a result set of data according to the search conditions or predicates. 

The invention introduces improved semantics to create result sets in response to queries 
directed to the relational database by the use of access statements as defined in the SQL 
Standards. By the improved semantics, access to a table of the database is performed by a scan of 
all rows of the table, both committed and uncommitted, regardless of existing locks, and by a 
return of committed rows which fulfill the query predicates. While both committed and 
uncommitted rows are scanned, the entire table need not necessarily be scanned. This still 
depends on the selected access path and is transparent to the new semantics. 

The algorithm that describes the improved semantics comprise the following rules: 

1 . The rows of a table are scanned with uncommitted read semantics. 

2. If a row does not satisfy the predicates, it is skipped and the scan is continued, 
else an appropriate lock is requested. 

3. If the requested lock is refused, the scan is suspended. Once the suspension is 
lifted, the row is re-evaluated to confirm that the row still satisfies the predicates. 

4. If the row no longersatisfies the predicates, the lock is released, the row is skipped 
and the scan is continued. Otherwise, the row is returned as part of the result set and the scan is 
continued. 

All rows of the result set are committed rows. Locks on rows that do not satisfy the 
predicates are neither requested nor acquired, which eliminates the situations where the lock 
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requester waits for a lock on the row that is not needed and avoids situations where the lock 
holder blocks other transactions by holding a lock on the row that is not part of the holder's result 
set. 

The result sets created by the existing semantical rules and the semantical rules according 
to the invention are in most cases the same. There is one exception which is demonstrated by the 
following sequence of events. 

The scan needs to visit a number of rows and it starts scanning at the time when a row R, 
that yet needs to be scanned, satisfies the predicates. Before the scan reaches row R, another 
transaction changes its contents so that row R no longer satisfies the predicates used by the scan. 
This row updating transaction has not yet become committed. At this point in time the scan 
comes to row R and skips it because it does not satisfy the predicates used by the scan. 
Thereafter, for some reason, the updating transaction backs out the changes to row R so that it 
would now satisfy the predicates used by the scan. The row R would thus not be part of the result 
set that is created by the scan although it satisfies the predicates. 

In general, the new semantics provide the same rules for the result set creation as the 
known isolation level semantics including the Cursor Stability mode or the Read Stability mode, 
with the exception of the special case described above. There are many applications that can 
tolerate these exceptions for the benefit of significantly reduced lock contention. 

An appropriate implementation of the new semantics is a modification of the known 
committed read isolation levels because it does not change their 'repeatability 1 attribute, but rather 
changes the way the result set is constructed. 

Figures 1 and 2 demonstrate in general the differences between the conventional 
semantics and the semantics according to the invention. Figure 1 represents the access to table 
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TAB A by using the conventional semantics. Table TAB A comprises a number of rows R n and 
which is accessed by two statements SI and S2. For a given range, only row 1 1 satifies the 
predicates of statement SI . At time ti 9 statement SI reads row 10 adjacent to row 1 1 and skips it 
because it does not qualify. At time t2, statement SI reads the next row, which is row 1 1, and 
retrieves it because it qualifies. At the same time statement S2 updates the subsequent row 12 
and holds an exclusive lock on this row. At time t3, statement SI comes to row 12 that is held by 
statement S2 and thus gets suspended. SI waits then for S2 to release the lock, even though row 
12 does not qualify. At time t4, statement S2 commits and releases the exclusive lock on row 12. 
Statement SI then reads row 12 and skips it because it does not qualify. 

Figure 2 represents the scanning with locking semantics according to the invention. The 
operation at times ti and t2 is the same as shown in figure 1 . At time t 3 , however, when Statement 
SI reaches row 12, which is held by statement S2, Statement SI does not not get suspended. 
Rather, Statement SI reads through the exclusive lock held by statement S2 and finds that the 
row does not qualify and skips it. At time t*, statement SI is already further in the table by 
reading row 13. The result set produced by statement SI is the same as in the conventional 
semantics, but without locking overhead and contention. 

Figure 3 shows an implementation of the invention in a database management system 
(DBMS) 30 which comprises a scanner 31 and a system of tables 32. The scanner receives Select 
Statements via line 33 and scans the rows of at least one of the tables 32. A read buffer 35 at the 
output of the table storage is connected to a qualifying checker 36 which determines whether a 
row read within the scan operation satisfies the predicates of the SQL statement received on line 
33. A lock control device 37 receives an input from the qualifying checker 36 if the later 
determines that the row satisfies the predicates. The lock control device 37 also receives inputs 
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from other components, not shown in figure 3. The scanner 32 is connected to a lock device 38 
which disregards pending locks held by other access statements and which permits a read through 
such locks during the scan of the table 32 as described above with reference to figure 2. 

Figure 4 represents the steps of a method to implement the algorithm described above. In 
step 40 an SQL statement is issued which initiates by step 41 a read scan in a table such as table 
32 in figure 3. This scan may be performed within an access range determined by the statement. 
During the scan each row of the table is read and evaluated in step 42 to determine whether the 
row fulfils the search conditions of the select statement. In the case that it does not satisfy the 
search conditions, the row is skipped and step 41 is repeated to continue the scan by reading the 
next row. If the row satifies the search conditions, a lock is requested in step 43 . In step 44, it is 
determined whether the lock is accepted. If the lock is accepted in step 45, the row is returned as 
a part of the result set of the scan, and the scan continues with the next row until the end of the 
scan range. If the requested lock is refused (step 44), i.e., a non- compatible lock of another 
statement is encountered, the scan is suspended in step 46. Subsequently, in step 47, when the 
blocking lock is released, the scan is resumed. In Step 48, the same row is evaluated again to 
determine whether it continues to satisfy the predicates. If the row still satisfies the statement 
predicates, the row is returned as a part of the result set in step 45. If the row no longer satisfies 
the predicates, it is skipped, and step 41 is repeated to continue the scan. 

The method steps described by reference to figure 4 may be implemented in a program 
product which is stored on a machine readable data carrier or transmitted through a digital data 
network. 

Example 1 
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Subsequently, examples are described which show the similarities and the differences of 
the current isolation level semantics and the semantics according to the invention. The examples 
use the following table TAB: 

Table TAB 

Rows Column CI Column C2 

Row 1 3 A 

Row 2 5 B 

Row 3 10 C 

In order to avoid complexity of considering different access paths (which is irrelevant for 
the presented examples) it is assumed that all rows need to be examined, i.e. either the statements 
below use tablespace scan, or use an index scan and end up with the three rows to examine 
without further index usage. 

There are two transactions, TRAN1 and TRAN2 that hold locks on Table TAB. The locks 
are acquired via the following statements: 

Tranl: UPDATE TAB SET C2=P WHERE CI =3 

The result of this statement is an exclusive lock on rowl . 

Tran2: UPDATE TAB SET C2=Y WHERE Cl=10 
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The result of this statement is an exclusive lock on row3. 
Tran3: SELECT ... FROM TAB WHERE Cl=10 AND C2=Y 

TRAN3 accesses Table TAB by using the known committed read isolation levels 
semantics. The scan starts by a request of a shared lock on rowl and gets suspended since at that 
time rowl is exclusively locked. 

Once the transaction TRAN1 commits, the shared lock on rowl will be granted to 
TRAN3. After evaluating the predicates the row will be rejected, but the lock might stay 
depending on the isolation level and internal database system implementation. The scan 
continues in the same fashion and reaches row3. A shared lock, is requested and the transaction 
gets suspended. Whether or not row3 gets included in the result set depends on TRAN2 
completion. If TRAN2 commits, row3 is in the result set. If it backs out, row3 is not in the result 
set. 

The behavior of TRAN3 is different, if the new semantics is used. The scan starts and 
rejects rowl and row2 without locking since they do not fulfil the predicates of the Select 
statement. The scan avoids overhead of acquiring locks on non-qualifying rows, transaction wait 
due to the lock suspension, potential timeout or deadlock as well as unnecessary locks that stay 
behind. When the scan reaches row3, TRAN2 already has changed column C2 of this row from 
'C to 'Y\ Thus, row 3 satisfies the predicates of the select statement in TRAN3 so that a shared 
lock is requested and the transaction gets suspended. After TRAN2 completes, by commit or by 
backout, the predicates are re-examined. Whether or not row3 gets included in the result set 
depends on TRAN2 completion. If it commits, row3 is in the result set. If it backs out, row3 is 
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not in the result set. Note that there are no differences in the result sets between the current 
isolation level semantics and the semantics according to the invention. 

Example 2 

This example relates to the same table TAB. It demonstrates the case where the current 
isolation level semantics and the semantics according to the invention generate different result 
sets. The table TAB and the transactions TRAN1 (holding an exclusive lock on rowl) and 
TRAN2 (holding an exclusive lock on row3) are the same. 

TRAN3 accesses Table TAB via the statement: 

SELECT ... FROM TAB WHERE Cl=10 AND C2=C 

TRAN3 uses the existing committed read isolation levels semantics. The scan starts, 
requests a shared lock on rowl and gets suspended. Once the transaction TRAN1 commits, the 
shared lock on rowl will be granted to TRAN3. After an evaluation of the predicates the row will 
be rejected, but the lock might stay depending on the isolation level and internal database system 
implementation. The scan continues in the same fashion and reaches row3. A shared lock is 
requested and the transaction gets suspended. Whether or not row3 gets included in the result set 
depends on TRAN2 completion. If TRAN2 commits, row3 is not in the result set. If it backs out, 
row3 is in the result set. 

The behavior of TRAN3 is now considered if the semantics according to the invention is 
used. The scan starts and rejects rowl and row2 without locking since they do not satisfy the 
Select statement. As in example 1, the locking overhead and contentions are avoided. If the scan 
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reaches row3 5 this row does not qualify (it currently has values Cl=10 and C2=Y). Regardless of 
whether TRAN2 later commits or backs out, the row3 is not included in the result set. 

Although the present invention has been described in accordance with the embodiments 
shown, one of ordinary skill in the art will readily recognize that there could be variations to the 
embodiments and those variations would be within the spirit and scope of the present invention. 
Accordingly, many modifications may be made by one of ordinary skill in the art without departing 
from the spirit and scope of the appended claims. 



-12- 



